AITopics | wordnet synset

Collaborating Authors

wordnet synset

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Discourse Representation Structure Parsing for Chinese

Wang, Chunliu, Zhang, Xiao, Bos, Johan

arXiv.org Artificial IntelligenceJun-16-2023

Previous work has predominantly focused on monolingual English semantic parsing. We, instead, explore the feasibility of Chinese semantic parsing in the absence of labeled data for Chinese meaning representations. We describe the pipeline of automatically collecting the linearized Chinese meaning representation data for sequential-to sequential neural networks. We further propose a test suite designed explicitly for Chinese semantic parsing, which provides fine-grained evaluation for parsing performance, where we aim to study Chinese parsing difficulties. Our experimental results show that the difficulty of Chinese semantic parsing is mainly caused by adverbs. Realizing Chinese parsing through machine translation and an English parser yields slightly lower performance than training a model directly on Chinese data.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2306.09725

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Africa > Middle East > Egypt > Giza Governorate > Giza (0.05)
Europe > Sweden > Vaestra Goetaland > Gothenburg (0.05)
(17 more...)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Automatically constructing Wordnet synsets

Lam, Khang Nhut, Tarouti, Feras Al, Kalita, Jugal

arXiv.org Artificial IntelligenceAug-7-2022

Manually constructing a Wordnet is a difficult task, needing years of experts' time. As a first step to automatically construct full Wordnets, we propose approaches to generate Wordnet synsets for languages both resource-rich and resource-poor, using publicly available Wordnets, a machine translator and/or a single bilingual dictionary. Our algorithms translate synsets of existing Wordnets to a target language T, then apply a ranking method on the translation candidates to find best translations in T. Our approaches are applicable to any language which has at least one existing bilingual dictionary translating from English to it.

artificial intelligence, natural language, text processing, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.3115/v1/p14-2018

2208.0387

Country:

Europe > Italy > Liguria > Genoa (0.05)
Africa > Middle East > Morocco (0.05)
North America > United States > Massachusetts (0.04)
(8 more...)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

TiFi: Taxonomy Induction for Fictional Domains [Extended version]

Chu, Cuong Xuan, Razniewski, Simon, Weikum, Gerhard

arXiv.org Artificial IntelligenceJan-29-2019

Taxonomies are important building blocks of structured knowledge bases, and their construction from text sources and Wikipedia has received much attention. In this paper we focus on the construction of taxonomies for fictional domains, using noisy category systems from fan wikis or text extraction as input. Such fictional domains are archetypes of entity universes that are poorly covered by Wikipedia, such as also enterprise-specific knowledge bases or highly specialized verticals. Our fiction-targeted approach, called TiFi, consists of three phases: (i) category cleaning, by identifying candidate categories that truly represent classes in the domain of interest, (ii) edge cleaning, by selecting subcategory relationships that correspond to class subsumption, and (iii) top-level construction, by mapping classes onto a subset of high-level WordNet categories. A comprehensive evaluation shows that TiFi is able to construct taxonomies for a diverse range of fictional domains such as Lord of the Rings, The Simpsons or Greek Mythology with very high precision and that it outperforms state-of-the-art baselines for taxonomy induction by a substantial margin.

category, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

1901.10263

Country:

Europe > Germany > Saarland > Saarbrücken (0.04)
Europe > Slovenia > Coastal-Karst > Municipality of Koper > Koper (0.04)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Industry:

Leisure & Entertainment (1.00)
Media > Film (0.47)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
(2 more...)

Add feedback

A Visual Distance for WordNet

Pérez-Arnal, Raquel, Vilalta, Armand, Garcia-Gasulla, Dario, Cortés, Ulises, Ayguadé, Eduard, Labarta, Jesus

arXiv.org Artificial IntelligenceApr-24-2018

Measuring the distance between concepts is an important field of study of Natural Language Processing, as it can be used to improve tasks related to the interpretation of those same concepts. WordNet, which includes a wide variety of concepts associated with words (i.e., synsets), is often used as a source for computing those distances. In this paper, we explore a distance for WordNet synsets based on visual features, instead of lexical ones. For this purpose, we extract the graphic features generated within a deep convolutional neural networks trained with ImageNet and use those features to generate a representative of each synset. Based on those representatives, we define a distance measure of synsets, which complements the traditional lexical distances. Finally, we propose some experiments to evaluate its performance and compare it with the current state-of-the-art.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

1804.09558

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Automatic Wordnet Development for Low-Resource Languages using Cross-Lingual WSD

Taghizadeh, Nasrin, Faili, Hesham

Journal of Artificial Intelligence ResearchMay-20-2016

Wordnets are an effective resource for natural language processing and information retrieval, especially for semantic processing and meaning related tasks. So far, wordnets have been constructed for many languages. However, the automatic development of wordnets for low-resource languages has not been well studied. In this paper, an Expectation-Maximization algorithm is used to create high quality and large scale wordnets for poorresource languages. The proposed method benefits from possessing cross-lingual word sense disambiguation and develops a wordnet by only using a bi-lingual dictionary and a monolingual corpus. The proposed method has been executed with Persian language and the resulting wordnet has been evaluated through several experiments. The results show that the induced wordnet has a precision score of 90% and a recall score of 35%.

synset, wordnet, wordnet synset, (15 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.4968

AI Access Foundation

11003

Journal of Artificial Intelligence Research

Country:

Asia > Middle East > Iran > Tehran Province > Tehran (0.04)
Europe > Italy > Liguria > Genoa (0.04)
Asia > South Korea (0.04)
(18 more...)

Genre:

Research Report > New Finding (1.00)
Workflow (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Approaches for Automatically Enriching Wikipedia

Syed, Zareen Saba (University of Maryland Baltimore County) | Finin, Tim (University of Maryland Baltimore County)

AAAI ConferencesJul-8-2010

We have been exploring the use of Web-derived knowledge bases through the development of Wikitology — a hybrid knowledge base of structured and unstructured information extracted from Wikipedia augmented by RDF data from DBpedia and other Linked Open Data resources. In this paper, we describe approaches that aid in enriching Wikipedia and thus the resources that derive from Wikipedia such as the Wikitology knowledge base, DBpedia, Freebase and Powerset.

artificial intelligence, natural language, text processing, (19 more...)

AAAI Conferences

Workshops at the Twenty-Fourth AAAI Conference on Artificial Intelligence

Country:

North America > United States > Maryland > Baltimore (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Maryland > Baltimore County (0.04)
(3 more...)

Industry:

Leisure & Entertainment > Sports (0.94)
Media > Music (0.93)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.67)

Add feedback

Large-Scale Taxonomy Mapping for Restructuring and Integrating Wikipedia

Ponzetto, Simone Paolo (University of Heidelberg) | Navigli, Roberto (Università di Roma "La Sapienza")

AAAI ConferencesJun-23-2009

We present a knowledge-rich methodology for disambiguating Wikipedia categories with WordNet synsets and using this semantic information to restructure a taxonomy automatically generated from the Wikipedia system of categories. We evaluate against a manual gold standard and show that both category disambiguation and taxonomy restructuring perform with high accuracy. Besides, we assess these methods on automatically generated datasets and show that we are able to effectively enrich WordNet with a large number of instances from Wikipedia. Our approach produces an integrated resource, thus bringing together the fine-grained classification of instances in Wikipedia and a well-structured top-level taxonomy from WordNet.

category, taxonomy, wikipedia, (16 more...)

AAAI Conferences

Twenty-First International Joint Conference on Artificial Intelligence

Country:

North America > United States > Massachusetts > Middlesex County > Reading (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

Word Sense Disambiguation for All Words Without Hard Labor

Zhong, Zhi (National University of Singapore) | Ng, Hwee Tou (National University of Singapore)

AAAI ConferencesJun-23-2009

While the most accurate word sense disambiguation systems are built using supervised learning from sense-tagged data, scaling them up to all words of a language has proved elusive, since preparing a sense-tagged corpus for all words of a language is time-consuming and human labor intensive. In this paper, we propose and implement a completely automatic approach to scale up word sense disambiguation to all words of English. Our approach relies on English-Chinese parallel corpora, English-Chinese bilingual dictionaries, and automatic methods of finding synonyms of Chinese words. No additional human sense annotations or word translations are needed. We conducted a large-scale empirical evaluation on more than 29,000 noun tokens in English texts annotated in OntoNotes 2.0, based on its coarse-grained sense inventory. The evaluation results show that our approach is able to achieve high accuracy, outperforming the first-sense baseline and coming close to a prior reported approach that requires manual human efforts to provide Chinese translations of English senses.

chinese translation, parallel text, synset, (15 more...)

AAAI Conferences

Twenty-First International Joint Conference on Artificial Intelligence

Country:

Asia > China > Hong Kong (0.05)
Africa > Middle East > Egypt > Giza Governorate > Giza (0.05)
Asia > Singapore (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.53)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.50)

Add feedback